A brief introduction to sparklyr

@TiffanyTimbers / @UBC

06/14/2019

What to do when code is slow?

Attribution: Javier Luraschi’s talk slides from SDSS 2019

Scaling Out with R and Spark

Attribution: Javier Luraschi’s talk slides from SDSS 2019

How/where can you use Spark

source: Zaharia et al. (2016). Apache Spark: A Unified Engine For Big Data Processing

Example cloud platform

  • Amazon EMR

Simple platform for getting started with Spark

(and sparklyr)